NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

What Should We Engineer in Prompts? Training Humans in Requirement-Driven LLM Use

https://doi.org/10.1145/3731756

Ma, Qianou; Peng, Weirui; Yang, Chenyang; Shen, Hua; Koedinger, Kenneth; Wu, Tongshuang (April 2025, ACM Transactions on Computer-Human Interaction)

Prompting LLMs for complex tasks (e.g., building a trip advisor chatbot) needs humans to clearly articulate customized requirements (e.g., “start the response with a tl;dr”). However, existing prompt engineering instructions often lack focused training on requirement articulation and instead tend to emphasize increasingly automatable strategies (e.g., tricks like adding role-plays and “think step-by-step”). To address the gap, we introduce Requirement-Oriented Prompt Engineering (ROPE), a paradigm that focuses human attention on generating clear, complete requirements during prompting. We implement ROPE through an assessment and training suite that provides deliberate practice with LLM-generated feedback. In a randomized controlled experiment with 30 novices, ROPE significantly outperforms conventional prompt engineering training (20% vs. 1% gains), a gap that automatic prompt optimization cannot close. Furthermore, we demonstrate a direct correlation between the quality of input requirements and LLM outputs. Our work paves the way to empower more end-users to build complex LLM applications.
more » « less
Free, publicly-accessible full text available April 24, 2026
The extreme depletion of ionospheric electron density and its hemispheric asymmetry during the May 2024 storm

https://doi.org/10.1093/nsr/nwaf307

Chen, Yanhong; Aa, Ercha; Yuan, Tianjiao; Zhang, Shunrong; Shen, Hua; Yue, Xinan; Xu, Heng; Liu, Siwei; Wang, Xin; Huang, Wengeng; et al (August 2025, National Science Review)

ABSTRACT The Earth's ionosphere plays a critical role in radio wave transmission, reflection, and scattering, directly affecting communication, navigation, and positioning systems. However, the comprehensive impacts of space weather remain to be fully established in cases where the ionosphere experiences strong disturbances during geomagnetic storms. We reported unprecedented observational evidence of extreme ionospheric electron density depletion and its hemispheric asymmetry during the May 10–12, 2024 super geomagnetic storm, utilizing multi-instrument ground-based and spaceborne in-situ observations. The ionospheric electron density significantly decreased, with a maximum reduction of 98% over the whole northern hemisphere for more than 2 days, causing backscatter echo failures in multiple ionosondes within the Chinese Meridian Project (CMP) monitoring network. In contrast, mid-to-low latitude regions in the southern hemisphere exhibited electron density enhancements. Thermosphere-Ionosphere-Electrodynamics General Circulation Model (TIEGCM) simulations demonstrated strong consistency with northern hemispheric observations. The vertical drift and the column integrated ratio of O and N2 (ΣO/N2) from observations and simulations indicated the deep reduction of total electron content (TEC) mainly generated by severe ion recombination associated with neutral composition changes that interacted with the disturbed electric field. The summer to winter neutral wind and asymmetry of O/N₂ were possibly responsible for the asymmetry in electron density between the northern and southern hemispheres. These results advance understanding of ionospheric storm physics by establishing causal links between magnetosphere-thermosphere coupling processes and extreme electron density variations, while providing critical observational constraints for space weather model refinement.
more » « less
"Here the GPT made a choice, and every choice can be biased": How Students Critically Engage with LLMs through End-User Auditing Activity

https://doi.org/10.1145/3706598.3713714

Prabhudesai, Snehal; Kasi, Ananya Prashant; Mansingh, Anmol; Das_Antar, Anindya; Shen, Hua; Banovic, Nikola (April 2025, ACM)

Free, publicly-accessible full text available April 25, 2026
Causally Modeling the Linguistic and Social Factors that Predict Email Response

https://doi.org/10.18653/v1/2025.naacl-long.594

Xu, Yinuo; Chen, Hong; Rakshit, Sushrita; Ananthasubramaniam, Aparna; Yadav, Omkar; Zheng, Mingqian; Jiang, Michael; Zhang, Lechen; Yi, Bowen; Alkiek, Kenan; et al (January 2025, Association for Computational Linguistics)

Full Text Available
How to Teach Programming in the AI Era? Using LLMs as a Teachable Agent for Debugging

https://doi.org/10.1007/978-3-031-64302-6_19

Ma, Qianou; Shen, Hua; Koedinger, Kenneth; Wu, Sherry Tongshuang (January 2024, Springer Nature Switzerland)

Large Language Models (LLMs) now excel at generative skills and can create content at impeccable speeds. However, they are imperfect and still make various mistakes. In a Computer Science education context, as these models are widely recognized as “AI pair programmers,” it becomes increasingly important to train students on evaluating and debugging the LLM-generated code. In this work, we introduce HypoCompass, a novel system to facilitate deliberate practice on debugging, where human novices play the role of Teaching Assistants and help LLM-powered teachable agents debug code. We enable effective task delegation between students and LLMs in this learning-by-teaching environment: students focus on hypothesizing the cause of code errors, while adjacent skills like code completion are offloaded to LLM-agents. Our evaluations demonstrate that HypoCompass generates high-quality training materials (e.g., bugs and fixes), outperforming human counterparts fourfold in efficiency, and significantly improves student performance on debugging by 12% in the pre-to-post test.
more » « less
Full Text Available
ScatterShot: Interactive In-context Example Curation for Text Transformation

https://doi.org/10.1145/3581641.3584059

Wu, Sherry; Shen, Hua; Weld, Daniel S; Heer, Jeffrey; Ribeiro, Marco Tulio (March 2023, IUI '23: Proceedings of the 28th International Conference on Intelligent User Interfaces)

The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an LLM to their specific tasks with a small number of examples. However, users tend to include only the most obvious patterns when crafting examples, resulting in underspecified in-context functions that fall short on unseen cases. Further, it is hard to know when “enough” examples have been included even for known patterns. In this work, we present ScatterShot, an interactive system for building high-quality demonstration sets for in-context learning. ScatterShot iteratively slices unlabeled data into task-specific patterns, samples informative inputs from underexplored or not-yet-saturated slices in an active learning manner, and helps users label more efficiently with the help of an LLM and the current example set. In simulation studies on two text perturbation scenarios, ScatterShot sampling improves the resulting few-shot functions by 4-5 percentage points over random sampling, with less variance as more examples are added. In a user study, ScatterShot greatly helps users in covering different patterns in the input space and labeling in-context examples more efficiently, resulting in better in-context learning and less user effort.
more » « less
Full Text Available
A Tale of Evil Twins: Adversarial Inputs versus Poisoned Models

Pang, Ren; Shen, Hua; Zhang, Xinyang; Ji, Shouling; Vorobeychik, Yevgeniy; Luo, Xiapu; Liu, Alex; Wang, Ting (January 2020, Proceedings of the ACM Conference on Computer and Communications Security)

Despite their tremendous success in a range of domains, deep learning systems are inherently susceptible to two types of manipulations: adversarial inputs -- maliciously crafted samples that deceive target deep neural network (DNN) models, and poisoned models -- adversely forged DNNs that misbehave on pre-defined inputs. While prior work has intensively studied the two attack vectors in parallel, there is still a lack of understanding about their fundamental connections: what are the dynamic interactions between the two attack vectors? what are the implications of such interactions for optimizing existing attacks? what are the potential countermeasures against the enhanced attacks? Answering these key questions is crucial for assessing and mitigating the holistic vulnerabilities of DNNs deployed in realistic settings. Here we take a solid step towards this goal by conducting the first systematic study of the two attack vectors within a unified framework. Specifically, (i) we develop a new attack model that jointly optimizes adversarial inputs and poisoned models; (ii) with both analytical and empirical evidence, we reveal that there exist intriguing "mutual reinforcement" effects between the two attack vectors -- leveraging one vector significantly amplifies the effectiveness of the other; (iii) we demonstrate that such effects enable a large design spectrum for the adversary to enhance the existing attacks that exploit both vectors (e.g., backdoor attacks), such as maximizing the attack evasiveness with respect to various detection methods; (iv) finally, we discuss potential countermeasures against such optimized attacks and their technical challenges, pointing to several promising research directions.
more » « less
Full Text Available
Mononuclear diploid cardiomyocytes support neonatal mouse heart regeneration in response to paracrine IGF2 signaling

https://doi.org/10.7554/eLife.53071

Shen, Hua; Gan, Peiheng; Wang, Kristy; Darehzereshki, Ali; Wang, Kai; Kumar, S Ram; Lien, Ching-Ling; Patterson, Michaela; Tao, Ge; Sucov, Henry M (March 2020, eLife)

Injury to the newborn mouse heart is efficiently regenerated, but this capacity is lost by one week after birth. We found that IGF2, an important mitogen in heart development, is required for neonatal heart regeneration. IGF2 originates from the endocardium/endothelium and is transduced in cardiomyocytes by the insulin receptor. Following injury on postnatal day 1, absence of IGF2 abolished injury-induced cell cycle entry during the early part of the first postnatal week. Consequently, regeneration failed despite the later presence of additional cell cycle-inducing activities 7 days following injury. Most cardiomyocytes transition from mononuclear diploid to polyploid during the first postnatal week. Regeneration was rescued in Igf2-deficient neonates in three different contexts that elevate the percentage of mononuclear diploid cardiomyocytes beyond postnatal day 7. Thus, IGF2 is a paracrine-acting mitogen for heart regeneration during the early postnatal period, and IGF2-deficiency unmasks the dependence of this process on proliferation-competent mononuclear diploid cardiomyocytes.
more » « less
Full Text Available

Search for: All records